An Efficient Method for Removing Deletion Errors in Quickly-spoken Connected Mandarin Digit String Speech Recognition

نویسندگان

  • Chunyi Guo
  • Runzhi Li
  • Kejun Liu
چکیده

Connected Mandarin digit string speech, especially at rapid spoken rate, is very difficult to recognize correctly. In this paper, a new training method named neighboring digits pattern is proposed in order to eliminate most of deletion errors which frequently occur in Mandarin digits speech recognition at high speaking rate when we have enough quickly-spoken speech data as the training set. The complete implementation process and the corresponding data analysis are presented detailed and the performance is compared with that of the conventional system through the experiments. The results of comparison explain that the new method can reduce the deletion errors effectively, and thus improves the system recognition rate from 96.4% to 98.3%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Neighboring Digits Pattern Training Method in Quickly-spoken Connected Mandarin Digits Speech Recognition

Deletion errors are most usually occurred in connected Mandarin digit string speech recognition when speaking rate is fast, and are the main reasons leading to the increasing of the recognition error rate and the decline of the recognition accuracy. In this paper, a new training method named neighboring digits pattern is given based on sufficient statistics of recognition errors of the traditio...

متن کامل

Duration Modeling in Mandarin Connected Digit Recognition

Digit string recognition is required in many applications which need to recognize numbers such as telephone numbers, credit card numbers, date, etc. In order to design a high performance recognizer, duration information is explored in this study. In a Mandarin connected digit recognizer, insertion and deletion errors amount to more than two thirds of the total recognition errors because there e...

متن کامل

Improvement in Connected Mandarin Digit Recognition by Explicitly Modeling Coarticulatory Information

The most successful training scheme for recognition of connected spoken digits is the segmental k-means algorithm, which implicitly captures the coarticulatory information of connected speech iteratively to establish reliable reference patterns. However, when this algorithm is applied to Mandarin digits, the obtained performance is inferior to that of English. Hence, a novel approach is propose...

متن کامل

Performance of Mandarin Connected Digit Recognizer with Word Duration Modeling

Digit string recognition is required in many applications such as automatic banking system, database information retrieving system, etc. In order to design a high performance recognizer, duration information is explored in this study. In a Mandarin connected digit recognizer, insertion and deletion errors amount to more than two thirds of the total recognition errors because there exist two mon...

متن کامل

An embedded word training procedure for connected digit recognition

The "conventional" way of obtaining word reference patterns for connected word recognition systems is to use isolatàd word patterns, and to rely on the dynamics of the matching algorithm to account for the differences in connected speech. Connected word recognition, based on such an approach, tends to become unreliable (high error rates) when the talking rate becomes grossly incommensurate with...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010